Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies.

نویسندگان

  • J van Helden
  • B André
  • J Collado-Vides
چکیده

We present here a simple and fast method allowing the isolation of DNA binding sites for transcription factors from families of coregulated genes, with results illustrated in Saccharomyces cerevisiae. Although conceptually simple, the algorithm proved efficient for extracting, from most of the yeast regulatory families analyzed, the upstream regulatory sequences which had been previously found by experimental analysis. Furthermore, putative new regulatory sites are predicted within upstream regions of several regulons. The method is based on the detection of over-represented oligonucleotides. A specificity of this approach is to define the statistical significance of a site based on tables of oligonucleotide frequencies observed in all non-coding sequences from the yeast genome. In contrast with heuristic methods, this oligonucleotide analysis is rigorous and exhaustive. Its range of detection is however limited to relatively simple patterns: short motifs with a highly conserved core. These features seem to be shared by a good number of regulatory sites in yeast. This, and similar methods, should be increasingly required to identify unknown regulatory elements within the numerous new coregulated families resulting from measurements of gene expression levels at the genomic scale. All tools described here are available on the web at the site http://copan.cifn.unam.mx/Computational_Biology/ yeast-tools

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding regulatory sites from statistical analysis of nucleotide frequencies in the upstream region of eukaryotic genes

We discuss two new approaches to extract relevant biological information on the Transcription Factors (and in particular to identify their binding sequences) from the statistical distribution of oligonucleotides in the upstream region of the genes. Both the methods are based on the notion of a “regulatory network” responsible for the various expression patterns of the genes. In particular we co...

متن کامل

Dynamical Analysis of Yeast Cell Cycle Using a Stochastic Markov Model

Introduction: The cell cycle network is responsible of control, growth and proliferation of cells. The relationship between the cell cycle network and cancer emergence, and the complex reciprocal interactions between genes/proteins calls for computational models to analyze this regulatory network. Ample experimental data confirm the existence of random behaviors in the interactions between gene...

متن کامل

Dynamical Analysis of Yeast Cell Cycle Using a Stochastic Markov Model

Introduction: The cell cycle network is responsible of control, growth and proliferation of cells. The relationship between the cell cycle network and cancer emergence, and the complex reciprocal interactions between genes/proteins calls for computational models to analyze this regulatory network. Ample experimental data confirm the existence of random behaviors in the interactions between gene...

متن کامل

Computational identification of transcription factor binding sites by functional analysis of set of genes sharing overrepresented upstream motifs in yeast S.cerevisiae

Transcriptional regulation is a key mechanism in the functioning of the cell, and is mostly effected through transcription factors binding to specific recognition motifs located upstream of the coding region of the regulated gene. The computational identification of such motifs is made easier by the fact that they often appear several times in the upstream region of the regulated genes. In this...

متن کامل

Distribution of Allele Frequencies at 5′-Flanking Region of CYP19 and ERα Genes between Iranian Simmental and Three Indigenous Cattle Breeds

This study was performed to investigate two polymorphic sites from Cyp19 gene (PvuII and MspI) and one polymorphic site from ERα gene (SnaBI) in four cattle breeds including Mazandarani, Taleshi, Sistani and Simmental. In overall 278 samples for CYP19 and 206 samples for ERα marker sites were genotyped using polymerase chain reactionsingle-strand conformation polymorphism (PCR-RFLP) procedure. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of molecular biology

دوره 281 5  شماره 

صفحات  -

تاریخ انتشار 1998